SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Dertzbaugh, Mark T. 

(ii) TITLE OF INVENTION: PROTECTIVE PEPTIDES OF NEUROTOXIN OF C. 
BOTULINUM 

(iii) NUMBER OF SEQUENCES: 22 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Hendricks and Associates 

(B) STREET: P.O. Box 2509 

(C) CITY: Fairfax 

(D) STATE: VA 

(E) COUNTRY: US 

(F) ZIP: 22031 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Pa ten tin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/917,791 

(B) FILING DATE: 31-JUL-2001 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Hendricks, Glenna 

(B) REGISTRATION NUMBER: 32,535 

(C) REFERENCE/DOCKET NUMBER: der09/917 , 791 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 703-425-8405 

(B) TELEFAX: 703-425-8406 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 




(iv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
ATATGGAATT CGTTAATAAA CAATTTAATT ATAAAGATCC 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
AGTATCGTCG ACTTTTAATT CTGTATCTAT TGTACTTCCA CC 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GATACAGAAT TCAAAGTTAT TGATACTAAT AG 
(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
CTTTGCGTCG ACTCCCCCAA ATGTTCTAAG TTCC 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
GGGTTAGAAT TCAGCTTTGA GGAACTTAGA ACATTTGGG 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AGGACTGTCG ACCAAGTCCC AATTATTAAC TTTGATTGAT AAATC 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TTAAATGAAT TCTCAATCAA AGTTAATAAT TGGGAC 
(2) INFORMATION FOR SEQ ID NO: 8: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CTCTGGGTCG ACTTCTAACA GAATAACAGC TCC 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GAAGTAAGAG CTCTGGATAA AATTGCGGAT ATAAC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

I 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
GCTATGATTA ATATAAATAA ATTTTTGAAT CAATGC 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
TAACCGGTCG ACACCATAAG GGATCATAGA G 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
AGTACTAAGC TTTTCATACA TACTATTATA TACAATAGC 
(2) INFORMATION FOR '< SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE': NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 



AAAAATAGAG CTCAATTATT TAATTTAGAA AGTAG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
ACCATCGTCG ACAAACATTA TATTATTACT AGC 
(2) INFORMATION FOR, SEQ ID NO: 15: 

t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
TATGGTGAAT TCATCTGGAC TTTACAGGAT ACTCAGG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
ATTTACGTCG ACATATTTAT TTGGATC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
GATAAGGAAT TCAATGAAAA AGAAATCAAA G 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: | nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
CTTCATGTCG ACTACTTGAC TTAGATTTCC 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
AACATTGAAT TCAATTCAAG TTTGTATAGG GGG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

j 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
TCCATCGTCG ACAGGAATAA ATTCCCATGA GCTACC 
( 2 ) INFORMATION FOR j SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: i amino acid 

(C) STRANDEDNESS: unknown 



(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-tenninal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

lie Lys Val Asn Asn Trp Asp Leu Phe Phe Ser Pro Ser Glu Asp Asn 
15 10 15 

Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu lie Thr Ser Asp Thr Asn 
20 25 30 

lie Glu Ala Ala Glu Glu Asn lie Ser Leu Asp Leu lie Gin Gin Tyr 
35 40 45 

Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro Glu Asn lie Ser lie Glu 
50 55 60 

Asn Leu Ser Ser Asp lie lie Gly Gin Leu Glu Leu Met Pro Asn lie 
65 70 75 80 

Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu Leu Asp Lys Tyr Thr Met 

85 90 95 

Phe His Tyr Leu Arg Ala Gin Glu Phe Glu Mis Gly Lys Ser Arg lie 
100 105 110 

Ala Leu Thr Asn Ser Val Asn Glu Ala Leu Leu Asn Pro Ser Arg Val 
115 120 125 

Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys Lys Val Asn Lys Ala Thr 
130 135 140 

Glu Ala Ala Met Phe Leu Gly Trp Val Glu Gin Leu Val Tyr Asp Phe 
145 150 155 160 

Thr Asp Glu Thr Ser Glu Val Ser Thr Thr Asp Lys lie Ala Asp lie 

165 170 175 

Thr lie lie lie Pro Tyr lie Gly Pro Ala Leu Asn lie Gly Asn Met 
180 185 190 

Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu lie Phe Ser Gly Ala 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. botulinum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Leu Asn Ser Ser Leu Tyr Arg Gly Thr Lys Phe lie lie Lys Lys Tyr 
1,5 10 15 

Ala Ser Gly Asn Lys Asp Asn lie Val Arg Asn Asn Asp Arg Val Tyr 
20 25 30 

lie Asn Val Val Val Lys Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala 
35 40 45 

Ser Gin Ala Gly Val Glu Lys lie Leu Ser Ala Leu Glu lie Pro Asp 
50 55 60 

Val Gly Asn Leu Ser Gin Val Val Val Met Lys Ser Lys Asn Asp Gin 
65 70 75 80 

Gly lie Thr Asn Lys Cys Lys Met Asn Leu Gin Asp Asn Asn Gly Asn 
i 85 90 95 

Asp lie Gly Phe lie Gly Phe His Gin Phe Asn Asn lie Ala Lys Leu 
100 105 110 

Val Ala Ser Asn Trp Tyr Asn Arg Gin lie Glu Arg Ser Ser Arg Thr 
115 120 125 

Leu Gly Cys Ser Trp Glu Phe lie Pro Val Asp Asp 
130 135 140 



